SemanticScuttle - klotz.me » Tags: instruction tuning+llm

Tags: instruction tuning* + llm*

0 bookmark(s) - Sort by: Date ↓ / Title /

A brief summary of language model finetuning

This article summarizes various techniques and goals of language model finetuning, including knowledge injection and alignment, and discusses the effectiveness of different approaches such as instruction tuning and supervised fine-tuning.

2024-11-01 Tags: llm, finetuning, instruction tuning, knowledge injection, alignment, supervised fine-tuning, relief by klotz

RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs

A method that uses instruction tuning to adapt LLMs for knowledge-intensive tasks. RankRAG simultaneously trains the models for context ranking and answer generation, enhancing their retrieval-augmented generation (RAG) capabilities.

2024-07-10 Tags: natural language processing, large language models, instruction tuning, context ranking, retrieval-augmented generation, nvidia, arxiv by klotz

NVIDIA Introduces RankRAG: Enhancing LLMs with Instruction Tuning

NVIDIA and Georgia Tech researchers introduce RankRAG, a novel framework instruction-tuning a single LLM for top-k context ranking and answer generation. Aiming to improve RAG systems, it enhances context relevance assessment and answer generation.

2024-07-10 Tags: rankrag, nvidia, llm, rag, instruction tuning, natural language processing by klotz

MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning

This paper proposes a new method called MoRA for parameter-efficient fine-tuning of large language models (LLMs). The proposed method, MoRA, employs a square matrix to achieve high-rank updating, maintaining the same number of trainable parameters. The paper suggests that low-rank updating, as implemented in LoRA, may limit the ability of LLMs to effectively learn and memorize new knowledge. MoRA outperforms LoRA on memory-intensive tasks and achieves comparable performance on other tasks.

2024-05-26 Tags: llm, parameter-efficient fine-tuning, mora, high-rank updating, lora, instruction tuning, mathematical reasoning, continual pretraining, memory, pretraining, sebastian reschka, microsoft research by klotz

NVIDIA AI Introduces ChatQA: A Family of Conversational Question Answering (QA) Models that Obtain GPT-4 Level Accuracies

ChatQA, a new family of conversational question-answering (QA) models developed by NVIDIA AI. These models employ a unique two-stage instruction tuning method that significantly improves zero-shot conversational QA results from large language models (LLMs). The ChatQA-70B variant has demonstrated superior performance compared to GPT-4 across multiple conversational QA datasets.

2024-01-24 Tags: llm, instruction tuning, nvidia, chatqa, sft by klotz

First / Previous / Next / Last / Page 1 of 0

SemanticScuttle - klotz.me

Tags: instruction tuning* + llm*

Linked Tags

Related Tags